Parallel Tree N-body Code: Data Distribution and DLB on the CRAY T3D for Large Simulations
نویسندگان
چکیده
We describe a strategy for optimal memory and work distribution. We have performed a series of tests to find an optimal data distribution in the Cray T3D memory, and to identify a strategy for the Dynamic Load Balance (DLB). The results of tests show that the step duration depends on two main factors: the data locality and the network contention. In a very large simulation, due to network contention, an unbalanced load arises. To remedy this we have devised an automatic work redistribution mechanism which provided a good DLB.
منابع مشابه
Ocean Modeling and Visualization on Massively Parallel Computer
Climate modeling is one of the grand challenges of computational science, and ocean modeling plays an important role in both understanding the current climatic conditions and predicting future climate change. Three-dimensional time-dependent ocean models require a large amount of memory and processing time to be run for realistic simulations. Recent advances in computing hardware, particularly ...
متن کاملA Work- and Data Sharing Parallel Tree N-body Code
We describe a new parallel N-body code for astrophysical simulations of systems of point masses interacting via the gravitational interaction. The code is based on a workand data sharing scheme, and is implemented within the Cray Research Corporation’s CRAFT c © programming environment. Different data distribution schemes have been adopted for bodies’ and tree’s structures. Tests performed for ...
متن کامل2 00 3 FLY . A parallel tree N - body code for cosmological simulations . Reference Guide
FLY is a parallel treecode which makes heavy use of the one-sided communication paradigm to handle the management of the tree structure. In its public version the code implements the equations for cosmological evolution, and can be run for different cosmological models. This reference guide describes the actual implementation of the algorithms of the public version of FLY, and suggests how to m...
متن کاملA Simple Synchronization and Communication Multi-threaded Library for Automatic Distribution of Irregular Sequential Code
We present a data-driven parallelization technique for sequential programs. The main originality of our approach is an absorption mechanism using specific wait/signal synchronizations based on special purpose counters called structural clocks. This mechanism allows to perform unmatching send/receive operations. Our parallelization technique can express unpredictable communications involved by i...
متن کاملAre You Ready to FLY in the Universe ? A Multi-platform N-body Tree Code for Parallel Supercomputers
In the last few years, cosmological simulations of structures and galaxies formations have assumed a fundamental role in the study of the origin, formation and evolution of the universe. These studies improved enormously with the use of supercomputers and parallel systems, allowing more accurate simulations, in comparison with traditional serial systems. The code we describe, called FLY, is a n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998